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For  normal  random  walks  Si,  S2,  •  •  formed  from  independent,  identically  distributed 
random  variables  Xi,  Xj,  •  •  •,  we  determine  the  asymptotic  behavior  under  regularity  conditions 


of 


*  fro J 


V  » 


P(Sn>mg(n/m)  for  some  n  <  mj S^-  m£0,  U^~mlo),  £0  <  y(l), 

where  U m  —  X*  +  •  •  •  +  X^.  The  result  is  applied  to  a  normal  change-point  problem  to 
approximate  null  distributions  of  test  statistics  and  to  obtain  approximate  confidence  sets  for 
the  change-point.  ' 


'4 


1.  Introduction.  A  method  of  developing  approximations  for  boundary  crossing  prob¬ 
abilities  which  has  received  some  attention  of  late  is  that  of  writing  the  probability  as  an 
expectation  of  a  conditional  boundary  crossing  probability  given  an  appropriate  random  vari¬ 
able,  and  then  developing  an  approximation  for  the  conditional  probability.  Such  a  method 
has  been  used  with  some  degree  of  success,  as  measured  by  the  accuracy  of  the  approximations, 
by  Siegmund  (1982,  1985,  1986),  Hu  (1985),  and  James,  James,  and  Siegmund  (1985). 

Let  be  independent,  identically  distributed  N(n,<r2)  random  variables,  with 

Sn  =  Xi  H - h  Xn  and  Un  =  X2  H - h  X2.  Given  a  function  g(t),  0  <  t  <  1,  and  m  >  1,  let 

r  be  the  possibly  defective  stopping  time 

t  =  rm  =  inf  (n  >  1  :  Sn  >  mg  (n/m)} 


Siegmund  (1982)  studied  the  asymptotic  behavior  of  the  conditional  probabilities 

P(r  <  m  |  Sm  =  mft),  ft  <  tf(  1), 

and  used  the  results  to  approximate  the  tail  probability  of  the  Smirnov  statistic  and  the  power 
function  of  repeated  significance  tests  for  a  normal  mean  when  a2  is  known.  In  this  paper,  we 
extend  Siegmund’s  method  to  study  the  asymptotic  order  of  the  conditional  probabilities 

(1.1)  P(r  <  m  |  Sm  -  mft,  Um  =  mA0),  ft  <  y(l), 

and  apply  the  result  to  some  change-point  problems. 

Our  main  result  is  stated  and  proved  in  the  next  section,  after  some  preliminary  lemmas. 
The  proof  uses  a  likelihood  ratio  argument,  but  we  believe  the  result  could  also  be  obtained 
using  the  method  of  Woodroofe  (1982,  Chapter  8).  On  the  other  hand,  the  method  of  mixtures 
of  likelihood  ratios  (cf.  Lai  and  Siegmund,  1977)  and  the  method  of  Siegmund  (1985,  Theorem 
9.54;  see  also  Hu,  1985),  which  seem  particularly  simple  in  certain  related  problems,  appear  to 
be  difficult  to  adapt  to  the  present  situation. 

Our  motivation  for  studying  the  conditional  probabilities  (1.1)  comes  from  our  investiga¬ 
tion  of  the  following  change-point  problem:  Let  Xi,  •  •  • ,  Xm  be  independent  random  variables 
with  Xi  —  N{m,02),  and  suppose  we  wish  to  test  the  hypothesis  of  no  change  in  mean,  Ho  : 


|  Mi  =  •  ■  •  =  Mm,  versus  the  alternative  of  a  single  change,  H\  :  mi  =  •  •  •  =  #  Mj'+i  =  •  •  •  =  Mm 

for  some  y  €  {1, •  •  •, m  —  1}.  We  can  then  use  the  theorem  of  the  next  section  to  obtain  approx¬ 
imations  for  the  significance  levels  of  several  tests  of  Ho,  as  well  as  to  obtain  likelihood-based 
confidence  sets  for  the  change-point  j.  These  applications  are  given  in  Section  3. 


2.  Asymptotic  conditional  boundary  crossing  probabilities.  Throughout  this 
section,  the  following  assumptions  and  definitions  will  hold.  Xi,“-,Xn  are  independent, 
identically  distributed  normal  random  variables,  without  loss  of  generality  assumed  to  be 
JNT(0, 1),  with  Sn  =  Xi  +  •  •  ■  +  Xn  and  Un  =  AC*  +  •  •  •  +  X *,  n  =  1, 2,  ■  •  • ,  m.  The  real-valued 
function  g,  defined  on  (0, 1],  has  two  continuous  derivatives.  For  a  fixed  £o  <  y(l),  there  exists 
a  unique  point  t*  €  (0, 1)  which  minimizes  the  function 


M*)  = 


g(*)  -  Co* 


and  further  satisfies  h(t*)  >  0,  lim inft_0  h(t)  >  h(t*),  and  h"(t‘)  >  0.  The  stopping  time 
r  =  rm  is  defined  by 

r  =  inf  {n  <  m  :  Sn  >  mg  (n/m)}  ; 


we  let  r  =  +oo  if  the  defining  set  is  empty.  Let  Ao  be  such  that  A0  >  g2(t’)(t  Tl  +  i9(f)- 
£o}2(1  -  t*)”1,  and  define  m  and  it2  by  m  =  and  a2  =  A0  -  -  {g(t*)  - 

£o}2(1  -  Let  £  =  m£0  and  A  =  mAo-  Finally,  for  any  x  €  27?  and  y  >  0,  we  let 


(A)  =  P{A  \Sm  =  x,Um  =  y) 


for  A  belonging  to  the  <?-field  generated  by  Xi,  •  •  • ,  Xm. 

It  can  be  seen  that  a2  =  \o  -  -  h2{t*),  which  in  turn  implies  that  A0  >  fo  an^ 

^2/(Ao-Co)  <  1-  Note  also  that  the  condition  h'(t*)  =  0  implies  p-g'(t*)  =  (m~ Co)/{2(l  —  ^*)}- 
Since  A(t*)  >  0,  this  implies  m  ~  ^'(f*)  >  0-  It  can  also  be  shown  that  the  conditions  on  h 
imply  that  1  4-  2g"(i*)t*(l  -  t*){/i  -  g'{t*)}~1  >  0.  Thus,  the  terms  that  appear  in  (2.6)  in  the 
statement  of  the  theorem  below  are  all  well-defined,  with  the  factor  <r2/ (Ao  -  £2)  taking  on  a 
value  between  0  and  1. 


The  following  two  lemmas  are  technical  and  will  be  used  in  the  proof  of  the  theorem. 


Lemma  1.  Assume  am  — »  oo  with  am  =  o(m1'/2),  and  let  bm  =  m1'Jlogm  and  Im  =  (mt*  - 
dm  m1/2,  mt*  +  a  mm'/2).  The  following  bounds  all  hold  as  m  — *  oo: 

(a)  maxi<n<nt_i/,j”)(5n  >  mff  (n/m))  =0 

(b)  /&’  (S.  >  m,  (n/m))  =  0  {(»*/(*>  -  {3))(“‘>,/’}  i 

(0)  P<”>  (|r/m  -  f  |  >  =  »  {(*’/(*»  -  i 

(d)  P<”>  (|r/m  -  f  I  <  S,  -  mj  (r/m)  >»„)=<.{ (<r*/(Ao  -  {}))(—•)«};  and 

(e)  for  each  fixed  e  >  0,  uniformly  for  n  and  r  such  that  |n  -  mt*|  <  am  m1/2  and  0  <  r  <  bm, 

P(!x  (I Uri/m  ~  (<r2  +  M2)**|  >  «  I  Sn  =  mg  (n/m)  +  r)  =  o(l). 

Proof,  (a)  The  conditional  density  of  Sn  given  Sm  —  g  and  Um  =  A,  which  is  easily  obtained 
via  the  conditional  joint  density  of  S„  and  Un,  is  given  by 


,  ,,  |  c  u  -n-(  m  -  V/2  F((m  -_l)/_2) 

/5„(x  |  5m  -  Um~  X)  -  ^n(m  _  n) )  r((m  _  2)/2) 

A  £\  -(m-3)/2  /  _  (g  -  »)2  _  x2\  (m~4)/Z 
\  m)  V  m-n  n ) 


if  x2n-1  +  (g-x)2(m-n)_1  <  A  (and  =  0  otherwise) .  After  integrating  and  changing  variables, 


we  obtain 


Pg)  «.  >  ms  (»/m))  =  /„.(>  - 


'  xl/2r((m  -  2)/2)  JBn ^  y  ' 

where  Bn  =  {y  :  |y|  <  1,  y  >  (A0  -  £2)_1/JMn/  m)}.  Now  if  0  <  a  <  1  we  can  show,  by  a 
change  of  variables  (x  =  y2)  and  appropriate  bounding  of  the  integrand,  that 

(2.2, 

Stirling’s  formula  for  the  gamma  function  implies  T((m  -  l)/2)/r((m  -  2)/2)  ~  (m/2)1/2. 
Thus,  it  follows  from  the  fact  that  h(n/m)  >  A(t*),  together  with  (2.1)  and  (2.2),  that 

(2.3)  Pff  (S„  >  mg  (n/m))  <  K m"1/2  ~  g-"7—) ' 

for  some  K  >  0  and  all  m  >  3  and  n  such  that  A2 (n/m)  <  A0  -  £2  (the  bound  is  0  otherwise) 
Part  (a)  now  follows  from  the  relations  <72  =  A0  -  g2  -  A2(f*)  and  h(n/m)  >  h(t'). 


(b)  and  (c).  Note  that 


Pffl  (l r/m  “  *’l  >  <  ^2  Pff  (S„  >  mg  (n/m)) , 

n$Im 


and  that  (2.3)  implies 

(2.4)  ( Sn  >  mg  (n/m))  <  Km'1/2 


(a  2  \  {m~S)/*  (X0-  ?o2  -  h2(n/m)  \  ("*-3)/2 

\Xo-ti)  V  A0-g 5  -  **(*•)  / 


Both  (b)  and  (c)  will  follow  by  developing  bounds  for  appropriate  sums  of  the  last  factor 
above.  By  the  assumptions  on  h,  this  factor  will  be  of  exponentially  small  order  in  m  if  n/m 
lies  outside  any  fixed  neighborhood  of  t *;  thus,  for  any  fixed  6  >  0,  we  may  restrict  attention 
to  n  such  that  |n/m  -  t*|  <  S.  But  Taylor’s  series  expansions  on  log(l  +  x),  to  one  derivative, 
and  h(t)  around  t* ,  to  two  derivatives,  yield  the  existence  of  Kq  >  0  and  S  >  0  such  that 


/A0-^-h*(n/m)\ (m"3)/J 

V  X0-$-h*(r)  ) 


<  exp  1-mKo  (n/m 


for  n  such  that  |n/m  -  t’|  <  6.  Parts  (b)  and  (c)  follow  by  summing  these  bounds  over  n  in 
Im  and  and  bounding  the  sums  appropriately  by  integrals. 


(d)  By  a  process  similar  to  that  used  to  obtain  (2.4),  we  have  that 
P$  (| r/m  -  t*|  <  amm_1'/J,  5,  -  mg  (r/m)  >  6m)  <  ^  P$  ( Sn  >  mg  (n/m)  +  bm) 


and  for  some  K'  >  0,  all  m  >  3,  and  all  n  such  that  \h(n/m)  +  bm/ (n(m  -  n)}1^]1  <  Ao  -  & 

,2\  (m~3)/2 


{S*  >  m,{»M  +  6„)  <  Im-'» 

fl-  <-V 

V  crJmJ  / 


f  -2  \ 

(m-3)/2 

<  K'm -1/2  1 

~  e0  y 

f 

/  -2  > 

<  2  1 

f  a 

—  \ 

\X()  —  £q  / 

/ 

(m-S)/2 


where  the  last  inequality  uses  the  fact  that  h(n/m)  >  h(t*).  Part  (d)  now  follows  by  using  the 
relation  1  —  a  <  e~a  for  0  <  a  <  1. 


(e)  By  Markov’s  inequality 


(2.5) 


Pf?  {Wn/m  -  {<72  +  fl2)t2 1  >  <f  I  Sn  =  x)  <  ^ 


Var  (Un/ m  |  Sn  =  x,  Sm  =  £,  Um  -  A) 


+  {E(Un/m  |  5„  =  i,  Sm  =  $,  Um  =  A)  -  {a2  +  mV}* 


Conditionally,  Un  is  a  linear  function  of  a  beta-distributed  random  variable.  In  fact  it  can  be 
shown  that  the  random  variable 

Un  ~  Sl/n 


V  = 


Um-S*/n-(Sm-Sny/(m-n)’ 
whose  numerator  is  one  of  two  independent  chi-squareds  making  up  the  denominator,  has  a 
beta  distribution  with  parameters  (n  -  1) /2  and  (m  -  n  -  l)/2  and  is  independent  of  the  vector 
(Sn,  Sm,  Um).  Therefore, 


E{Un  |  Sn  =  *,  Sm  =  £,  Um  =  A)  =  -  +  (a  -  -  z)-  -  -) 

n  \  m  —  n  n )  (m  - 


- II 
2) 


and 


Var (Un  \Sn  =  x,  Sm  =  Um 


x1  N  2  (n  -  l)(m  -  n  -  1) 
n  )  m(m  -  2)2 


Part  (e)  now  follows  from  (2.5)  and  the  above  by  algebra.  I 

Lemma  2.  For  each  e  >  0, 

(a)  P(”x  i\sr/m  -  fit*\  >  e,  T  <  m)  =  o  |(ff2/(A0  -  ?o))(m_3)/2}  111(1 

(b)  P$  (| UT/m  -  [a2  +  /i2)t’\  >  €,  r  <  m)  =  o  { (<r2/(A0  -  ^))(m“3)/2}. 

Proof.  Let  am  —  logm.  Applying  first  Lemma  1(c)  and  then  the  triangle  inequality,  we  have 

LHS(a)  =  P ^  ^|ST/m  -  fit’ |  >  c,  |r/m  -  t*|  <  amm~1//2^  +  o 
<  (sT  -  mg(r/m )  >  |r/m  -  t*|  <  ^m'1''2) 

Kj  \  (m— 3)/2  1 

~r$)  J  • 

The  second  summand  on  the  right-hand  side  above  is  null  for  n  sufficiently  large,  since  g 
is  continuous  and  g(t ’)  =  fit’.  The  first  summand  can  be  handled  by  Lemma  1(d),  thus 
completing  the  proof  of  (a). 


.Ao-tf 


[m-3)/2' 


Apply  parts  (c)  and  (d)  of  Lemma  1  to  get 


LHS(6)  =  P^  {\UT/m  -  (a2  +  n2)tm |  >  e,  ST  -  mg  (r/m)  <  bm,  \ r/m  -  t'|  <  amm  1/2) 


Now  decompose  the  above  event  according  to  the  value  of  r,  letting  Im  be  the  interval  (mt*  - 
m1/2,  mt *  +  am  m1/2): 


LHS(6)  <  ^2  (| Un/m  -  (a2  +  /»*)**  |  >  «,  0  <  Sn  -  mg  (n/m)  <  bm) 


<  £  Pff  (Sn  >  mg  (n/m))  (| Un/m  -  (a2  +  /iJ)f*l  >  e  |  0  <  Sn  -  mg  (n/m)  <  bm) 

n€/» 


Part  (b)  now  follows  by  applying  first  Lemma  1(e)  and  then  Lemma  1(b).  | 

Remark  1.  We  will  show  in  the  course  of  the  proof  of  our  main  theorem  that  P^  (r  <  m)  > 
K{tr2/(Ao  ~  £o)}^m~3^2  for  some  K  >  0.  Then  Lemmas  1(c)  and  2  will  give  us  convergence 
of  r/m,  Sr/m,  and  UT/m  to  t*,  fit' ,  and  (a2  +  /i2)t*  in  conditional  P^ -probability  given 
(r  <  m},  i.e.  for  each  e  >  0 


Pff  (\T/m  -  «*|  >  e  I  r  <  m)  —  0, 

P\mx]  (\ST/m  -  /it* |  >  e  |  r  <  m)  —  0, 

and 

P^  (| UT/m  -  ( a 2  4-  /i2)t*|  >  e  |  r  <  m)  — ►  0. 

Remark  2.  Formula  (2.1)  shows  that  the  marginal  probability  P^(Sn  >  mg(n/m))  is  maxi¬ 
mized  by  that  n  which  minimizes  h(n/m),  i.e.  by  some  n  not  far  from  mt*.  When  m  is  large, 
then,  it  would  seem  reasonable  that  if  the  partial  sum  process  were  to  cross  the  curve  at  all, 
it  would  do  it  for  n  near  mt* .  We  see  from  Remark  1  that  this  holds. 


Theorem.  Let  u  be  the  function  defined  for  t  >  0  by 


v(t)  =  2 t  2  exp 


|-2^n-1$(-tn1/2/2)|  , 


where  $  is  the  standard  normal  distribution  function.  Then,  as  m  — »  oo, 


<r2  \(*- *)/*r  2<?"(t*)t*(l-t*)\"1/2 


1  + 


m  -  g‘{r) 


Remark  3.  The  function  v  can  be  evaluated  either  directly  by  numerical  computation  or 
approximately,  at  least  in  the  range  0  <  t  <  2,  from  the  local  expansion 


i/(t)  =  exp(-pt)  +  o(t2),  t->0, 


where  p  is  a  numerical  constant  which  is  approximately  equal  to  .583.  See  Siegmund  (1985, 
Ch.  X). 

Proof.  Let  Pz™}n  denote  the  restriction  of  to  the  tr-field  generated  by  Xi,  •  •  ■ ,  Xn.  Let 
=  mp  and  Aj  =  m(<r2  +  p2).  The  idea  of  the  proof  is  to  use  a  likelihood  ratio  argument, 
based  on  the  likelihood  ratio  of  P ^  with  respect  to  .  The  values  of  and  At  are  chosen 
because  of  the  approximately  equivalent  local  behavior  of  the  pre-r  process  under  P ^  A  and, 
conditionally,  under  P^1  given  {r  <  m}.  In  fact,  given  {r  <  m},  Remark  1  tells  us  that 
ST/r  — *  p  and  Ut/t  -*  <t2  +  p2  in  P^ -probability. 

Let  Ln  denote  the  likelihood  ratio  of  the  absolutely  continuous  part  of  P^  n  relative  to 
FiX  A  straightforward  calculation  shows  that  for  n  <  m  —  2, 


(  \  -Un-{Z-Sn)2/{m-n)  (X  x-  Zl/rn\{m'3)  12 

\Ai  —  Un  —  ($i  —  Sn)2/(m  —  n)  J  \*-Z2/m) 


if  Ai  -  Un  -  (Ci  -  S„)2/(m  -  n)  >  0  and  A  -  U„  -  (£  -  Sn)2/(m  -  n)  >  0,  and  Ln  -  0  if 
A  -  Un  -  (C  -  Sn)2/(m  -  n)  <  0  <  Ax  -  I7„  -  (Ci  -  Sn)2/(m  -  n). 


By  a  slight  generalization  of  Wald’s  likelihood  ratio  identity  (see  e.g.  Siegmund  1985, 


p.  13), 


(2.7) 


<  m  -  2)  =  /  LrdP^l  +  P<J({r  <  m  -  2}  n  Ac), 

J{r<m-2}nA 


where  A  —  {Ai  —  Ur  —  (£j  —  Sr)2/(m  —  r)  >  0}.  By  Lemma  1(a),  it  is  sufficient  to  show  that 
the  integral  in  (2.7)  is  asymptotically  equivalent  to  the  right-hand  side  of  (2.6)  and  the  final 
probability  in  (2.7)  is  of  smaller  asymptotic  order. 


(2.8) 


Upon  substitution  of  the  likelihood  ratio,  the  integral  in  (2.7)  becomes 
2  \  (m-3)/2 

I 

{r<m-J}nA 


Ao 


'  f  (  A  -Crr-(e-ST)V(m-r)  \{m~r~3)/2  (m] 

i(r<m-l}nA  \Ai  -  Ur  -  (£i  -  Sr)2/(m  -  r)/  {l,Al' 


Law  of  large  numbers  arguments  indicate  that  under  ,  as  m  — *  oo, 


r  P  5r  />  ,  UTp  2  2 

- ►  t  t - ►  /it  ,  and - *  (<r  +  a*  )t  , 

mm  m 


so  that 


(2.9)  m"1  (\-UT-  (l  -  £*),  m'1  ^Aj  -  UT  -  ^  <r2(l  -  f). 

Since  log(l  +  x)  =  x  +  0(x2)  as  r  -*  0,  we  have 

(2.10) 

A-Ur-($-5T)V(m-r)  rA  -  At  -  U2  -  -  25T($  -  ft)}/(m  -  r) 

Ai  -  Ur  -  (fj  -  5r)*/(m  -  r)  Pl  A!  -  UT  -  (6  -  ST)J/(m  -  r) 


+  Op(|Ao-aJ-M!- 


(£o  ~  M2  ~  2(^o  -  n)ST/m) 
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Letting  fZm  be  the  excess  over  the  boundary,  i.e.  Rm  —  ST  —  my(r/m),  and  using  a  Taylor 
expansion  on  g  at  t',  we  get 

(2.11)  ST  =  Rn  +  m  j$(t*)  +  g'(t')  (r/m  -  t*)  +  9--*  -  (r/m  -  £*)2  +  £  (r/m)  (r/m  -  U)2 1  , 

where  e(£)  — »  0  as  t  — ►  t*.  To  obtain  the  limitingjoint  distribution  of  Rm  and  m1/,2(r/m-£*)  we 
must  appeal  to  an  appropriate  nonlinear  renewal  theorem  for  the  conditional  process  governed 
by  , ■  For  411  intu^ive  discussion  of  nonliner  renewal  theory  which  leads  one  to  the  correct 
limiting  joint  distribution,  see  Siegmund  (1986,  Appendix  2  and  Lemma  2.16).  Hu  (1985, 
Chapter  4,  Theorem  10)  has  proved  a  general  result  which  provides  a  rigorous  justification. 
The  upshot  is  that  Rm  and  ml/,2(r/m  -  t*)  converge  in  distribution  and  are  asymptotically 
independent  under  ;  ^e  limiting  distributions  will  be  seen  below.  This,  together  with 


some  algebra,  means  that  the  right  hand  side  of  (2.10)  can  be  written  as 
2(6  -  M)(#m  +  mg"(r)(r/m  -  t')2/2  +  mf(r/m)(r/m  -  t')2) 


(1  -  r/m)(A1  -UT-  (6  -  Sr)2/(m  -  r)) 
If  we  insert  this  in  the  integrand,  (2.8)  becomes 


-  e0 


(m-3)/2 


'  {r<m-2}nA 


f  (m  -  r  -  3)(6  -  »){Rm  +  mp"(t*)(r/m  -  t*)2/ 2  +  op(l))  ,__n  )  JD(m) 

”P1 - (t  -  r/mHA,  -  (/-(£,  -  S,)V(m  -  0) - +' °'  ]  I 


Application  of  (2.9)  then  yields 

/  a2  x(^-3)/2  , 

(2.8)=  / 

£2  22)  'A°  ^°'  J{r<m-2}nA 

/  (6  -  A»)(f2m  +  mg" [t')(r / m  -  t*)2/ 2) 

expl - - 

It  follows  from  (2.9)  that 


PS.  (<•  <  m  -  2,  A,  -  [f,  -  >  °)  -  ',  m  — 

so  that  if  we  may  interchange  expectation  and  limit  in  (2.12),  we  will  be  able  to  evaluate  the 
order  of  (2.8)  by  using  Hu’s  result.  This  result  states  that  asm-*oo 

***■*,)-  *<*> '  A *  »>■ 

where  $  is  the  standard  normal  distribution  function  and  R *  is  the  excess  over  the  con¬ 
stant  boundary  c  of  a  random  walk,  generated  by  independent,  identically  distributed  N(n  - 
g'{t *),  <72)  random  variables,  which  13  stopped  the  first  time  it  exceeds  c.  If  R  is  a  random 
variable  such  that  R‘  —>  R  as  c  — • >  oo,  then  renewal  theory  (see  Siegmund,  1985,  Chapter  VIII) 


allows  us  to  calculate 


£'xp{|r^H{ 


2 (M -?'(**)) 


where  we  use  the  fact,  noted  earlier  in  this  section,  that  p  -  g'{t*)  =  (fx—  £o)/{2(l  -  **)}.  If  X 
has  a  chi-squared  distribution  with  one  degree  of  freedom,  the  remaining  factor  will  have  the 


5"(C)f(l  -  f) 


9'{n 


2g"{f)r(i-r)\  1/2 


Therefore,  we  will  be  able  to  conclude  that  the  right-hand  side  of  (2.6)  is  asymptoticaly  equiv¬ 
alent  to  (2.8)  if  we  can  make  the  exchange  of  expectation  and  limit  alluded  to  above. 


Fatou’s  Lemma  for  convergence  in  law  implies  that  the  right-hand  side  of  (2.6)  is  an 
asymptotic  lower  bound  for  (2.8).  By  Lemma  1(c),  / m  -  r*|  >  a**)  is  of  asymp¬ 

totically  smaller  order  than  this  if  ^  — ►  oo  and  =  o(mly,J).  The  analog  of  formula  (2.7) 
with  /m  —  t*|  >  a«}  in  place  of  {r  <  m  -  2}  then  implies  that 

J  LTdpg>i  =  oj(— ~yY  ]'  \. 

J {m1^,\r/m-f\>am}nA  ^  \  Ao  £o  /  J 

Therefore,  for  any  sequence  {am}  such  that  a,*  — ♦  oo  and  am  =  ©(m1/2), 

(*»— 3)/2 


(2.13) 


(2.8)  -(3—71)  / 

(  \-Ur-ti-ST)*/(m-r)  \  (m-r-S)/J  (m) 

Ux-Ur-i^-Sry/im-r)] 


From  Lemmas  1  and  2,  via  Remark  1,  we  see  that  for  all  e  >  0 
(2.14)  (W1  |av  -Ur-  -  cr2(l  -  f)  I  >  e  I  r  <  m  -  2) 


0, 


so  that  we  may  replace  the  event  A  in  the  above  by  {(Aj  —  UT  —  (^  -  ST)2/(m  -  r))/m  > 
<t2(1  -  t*)/2}.  We  may  use  the  fact  that  log(l  +  x)  <  x  if  r  >  0  to  obtain  bounds  for  the 
integrand  in  the  right-hand  side  of  (2.13)  in  the  region  of  integration: 


/  A  -  UT  -  ($  -  Sf)2/(m  -r)\  (m"r-3)/2 
Ul-Ur-fr-Srmm-T)) 


<  1  V  exp 


(m-r-3)  /A-Ax-{g2-g2-2ST($-fr)}/(m-r) 
2  V  Ax  -  Ur  -  tfi  -  Sr)2/(m  -  r) 


by(2.U) 
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1  V  exp 
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< 

1  V  exp 

2(m-r-3)($0-ji){flm  +  mg,,(t*)(r/m-t*)2/2  +  me(r/m)(r/m) -f)2} 


tr2(l  -  t*)(l  -  r/m) 

m(r/m  -  t*)2{($0  -  ii)g"{t*)  +  2jt(r/m)l>'l 
<r2(l  -  f) 


<  exp {Ka2m), 


for  some  K  >  0  and  m  large  enough  to  make  a m/m1/2  sufficiently  small.  We  may  then  choose 
a  suitable  sequence  {or*},  converging  sufficiently  slowly  to  +00,  to  allow  us  to  finish  the  proof 
of  the  asymptotic  equivalence  of  (2.8)  and  the  right-hand  side  of  (2.6). 


Equation  (2.14)  also  indicates  that  the  final  probability  in  (2.7)  is  of  smaller  asymptotic 
order,  because  it  implies  that  P^(Ae  |  r  <  m  -  2)  — *  0.  I 

As  an  aid  in  applying  the  theorem,  we  note  that  if  g  has  two  continuous  derivatives,  then 
the  other  conditions  on  g  and  h  will  be  satisfied  if  (i)  t*  is  the  only  point  at  which  h!  —  0,  (ii) 
>  0,  and  (iii)  h"(t *)  >  0.  This  will  be  the  case  in  the  examples  considered  in  the  next 
section. 


3.  Application:  Tests  and  confidence  sets  for  a  change-point. 

3.1.  Approximate  significance  levels.  Let  Xi,  i  =  l,---,m,  be  independent  normal 
random  variables  with  unknown  means  ^  and  constant  unknown  variance  a2  >  0.  We  consider 
testing  the  hypothesis  of  a  constant  mean  against  the  alternative  of  a  single  change-point,  i.e. 
testing  Hq  :  pi  =  •  •  •  =  versus  Hi  :  for  some  j  €  {1,  •  •  • ,  m  -  1},  m  =  •  •  •  =  Hj  ^  = 

’  '  '  =  A*m- 

We  will  focus  here  on  three  tests  considered  in  James,  James,  and  Siegmund  (1985):  the 
likelihood  ratio  test,  a  Studentized  version  of  a  score-like  test  due  to  Pettitt  (1980)  which  was 
introduced  for  the  case  of  known  variance,  and  a  modification  of  the  recursive  residuals  test 
proposed  by  Brown,  Durbin  and  Evans  (1975).  Approximations  to  the  significance  levels  of 
these  tests  were  given,  without  formal  justification,  in  James,  James,  and  Siegmund  (1985).  We 
will  now  show  how  the  theorem  of  the  last  section  can  be  used  to  obtain  these  approximations 
and  also  obtain  approximate  confidence  sets  for  the  change-point  j.  We  first  give  the  test 
statistics  for  the  three  tests  being  considered. 

The  generalized  likelihood  ratio  test  can  be  easily  shown  to  be  based  on  the  statistic 

l$n  ~  n*m| 

l<r»<m-i  {n(l  —  n/m)}1/2.?  ’ 

where  S„  =  Xx  +  •  •  +  Xn,  Xn  =  (Xt  +  •  •  •  +  Xn)/n,  and  S  =  {m~l  -  Xm)2}1/2.  In 

the  1985  paper  cited  above,  we  considered  the  larger  family  of  tests  based  on  statistics  of  the 
form 

rp  _  |^f»  ~  n^m\ 

1  mo<n<n»i  {n(l  —  n/m)}l/2S ' 

where  l<mo<mi<m-l.  The  use  of  T\  with  mo  >  1  and  mi  <  m  -  1  allows  one  greater 
power  than  that  of  the  likelihood  ratio  test  for  values  of  j  near  m/2,  while  giving  up  some 


power  for  small  and  large  j  where  the  change  is  difficult  to  detect  in  any  case.  If  mo  =  m  -  m1( 
then  the  test  based  on  T\  belongs  to  a  general  family  of  tests  considered  by  Deshayes  and 
Picard  (1984). 

The  Studentized  version  of  Pettitt’s  test  is  based  on  the  statistic 


Ti  =  max 

l<n<m— 1 


I  Sn  -  nX 


The  recursive  residuals  statistic  of  Brown,  Durbin  and  Evans  is  formed,  in  the  case  of 
known  variance,  by  accumulating  sums  of  standardized  residuals  Zn  of  the  X+i  from  the 
previous  means  X,  that  is 

Zn  =  {n/(n  +  l)}^(X+i  -  X),  n  =  1, 2,  •  •  • ,  m  -  1. 

The  Zn  are  independent  with  common  distribution  N(0,  a2)  under  Ho-  In  the  case  of  unknown 
variance,  the  accumulated  sums  are  Studentized  by  dividing  by  the  sample  standard  deviation 
of  the  Z' s.  In  James,  James,  and  Siegmund  (1985),  power  considerations  lead  us  to  study  the 
statistic  obtained  by  summing  the  standardized  residuals  “from  the  right,”  so  that  we  base  the 
recursive  residuals  test  on  the  statistic 

T3  =  max  -^L, 

where  S'n  =  Zm-i  H - 1-  Zm-„  and  S'  —  {(m  —  1)~1(Z2  -j - H  j)}1^2.  However,  as  long  as 

we  are  only  concerned  with  the  significance  level,  it  makes  no  difference  whether  the  recursive 
residuals  are  summed  from  the  right  or,  as  Brown,  Durbin,  and  Evans  proposed,  from  the  left. 
We  note  also  that  although  we  are  considering  this  very  simple  problem  of  a  change  in  mean, 
the  null  hypothesis  distribution  of  the  recursive  residuals  statistic  is  in  fact  the  same  as  in  the 
general  regression  model  of  Brown,  Durbin,  and  Evans,  the  only  difference  being  that  there  are 
m  —  p  recursive  residuals  Zn  instead  of  m  —  1,  where  p  is  the  number  of  regression  parameters. 

We  will  first  obtain  approximate  significance  levels  for  the  tests  based  on  T\  and  Ti .  Since 
the  distribution  of  the  process  {(Sn  —  njtm)/S,  n  =  0, 1,  ■  •  • ,  m}  does  not  depend  on  (p,  <x2) , 
the  process  is  independent  of  the  complete  sufficient  statistic  ( Sm,Um ),  by  Basu’s  theorem 


(Lehmann,  1959,  Theorem  5.2).  Therefore,  starting  with  the  easier  to  handle  T2  we  have 


P(T2  >  6)  =  P0(5(T2  >b)  =  P^mg^  15,1  >  b). 

If  we  assume  that  b  =  bm  ~  mg,  for  some  0  <  f  <  j,  then  we  can  apply  our  theorem  with 
g(t)  =  $ ,  =  0)  Ao  =  1,  and  t*  =  |  to  obtain 


_  □("»)(■ 


(m— 3)/2 


An  approximate  size  a  test  based  on  T2  can  now  be  obtained  by  using  as  critical  value  b  =  mg, 
where  g  makes  the  right-hand  side  of  (3.1)  equal  to  a. 

For  the  modified  likelihood  ratio  test,  we  have 
P(Tx  >  6)  =  pMpt  >  b) 

~  Pq  "n  (  max  T~i - ^"~T — 7777?  -  ^  • 

’  \*no<n<mi  {n(l  —  fl/m)}1/2  J 


Conditioning  with  respect  to  the  values  of  Sm,  and  Umi  and  using  the  Markov  property,  we 
have 

P(Ti  >  b)  =  P«g  (|Smi|  >  6{mj  (1  -  m1/m)}1/2) 

(3’2)  +  J  Ja  P&l)(r  <  mx)PoS(5ml  6  dx,  Umi  e  dy), 

where  r  =  inf[n  >  mo  :  Sn  >  6{n(l  -  n/m)}1/2]  and  Am  is  the  set  of  (x,  y)  such  that 
|x|  <  6{mx(l  -  mx/m)}1/2  and  the  P^ -joint  density  of  Smi  an d.Umi,  as  a  function  of  x  and 
y,  is  positive. 

The  first  summand  in  the  right-hand  side  of  (3.2)  can  be  calculated  exactly,  as  in  (2.1). 
The  theorem  can  be  used  to  approximate  the  integrand  in  the  second  summand.  For  this, 
assume  that  6  =  cm1?2,  m o  =  mto,  mi  =  mti,  z  =  mixo,  and  y  =  m iyo,  where  0  <  c  <  1, 
0  <  t0  <  <i  <  1,  |*o|  <  ctr1/l(l  “  and  miy0  is  a  P^-possible  value  of  Umi,  given 

Smi  =  m!Z0.  For  g{t)  =  c{ttj*x(l  -  tti)}1/2,  £0  =  *o  and  A0  =  yo,  we  obtain 

f 

c2(l-t1)2  +  z2t2- 

The  only  hitch  in  applying  the  theorem  is  that  r  >  mo;  a  direct  application  requires  mo  =  1. 
However,  this  is  no  problem  if  t*  >  to/ti,  because  in  this  case  it  follows  from  (2.4)  that  the 


Pi”1  ^probability  of  the  process’s  crossing  the  boundary  before  n  =  mo  is  of  exponentially 
smaller  order  than  that  of  crossing  before  n  -  mi,  so  that  we  may  replace  mo  by  1.  On  the 
other  hand,  if  t*  <  to/ti,  which  corresponds  to  |xo|  <  ctj*(l  -  ti){to(l  -  to)-1}1/2,  we  can 
approximate  the  integrand  by  0.  This  follows  again  from  (2.4),  which  implies  that  the  integrand 
will  be  of  exponentially  smaller  order  than  other  values  of  the  integrand  corresponding  to 
t*  >  to/ti .  Therefore,  we  are  led  to  the  approximation 

pv>  >- »)  ■  /  V 

■  o, 

\  mi  mi  / 

where  a1  —  <r2(x0,  j/to)  =  yo  -  c2tj 1  +  x§ti(l  -  ti)-1  and 

<I0<c(Lz£i)  ' }. 

The  factor  2  above  is  due  to  restriction  to  positive  values  of  x,  by  symmetry. 

A  further  approximaton  can  be  made  upon  insertion  of  the  conditional  density  into  the  in¬ 
tegral,  with  subsequent  utilization  of  the  fact  that  Umi  is  conditionally,  given  Sm  =  0,  Um  =  m, 
and  Sm,  =  miio,  a  linear  function  of  a  random  variable  with  a  beta  distribution  with  param¬ 
eters  (mi  -  l)/2  and  (m  -  mi  -  l)/2,  which  as  m  — »  oo  with  mi/m  — ♦  ti  collapses  to  a  point 
mass  at  ft-  Following  this  procedure,  we  can  insert  the  Pg^-density  of  (5m,/rni,  Cfm,/rni), 
to  wit 

-  -><* + ^ 

make  the  change  of  variable  (in  yo) 


z  =  ti(l  -  cJ)_1{yo  +  ZqM1  ~  <i)  “  c  *i  }» 


integrate  with  respect  to  z,  and  use  Stirling’s  formula  to  approximate  the  remaining  gamma 
functions,  to  show  that  the  double  integral  appearing  above  reduces  asymptotically  to  a  single 


integral  in  Xq.  Thus  we  are  led  to  the  approximation 


(3.3) 


P{Ti  >b)=*  (^j  V2  j\  1  -  x2)^-Wdz 


+ c (22) 1/2 (i - c^-o/*  ‘-1)/(1-e3)},/J iuL+ | *. 

V  »  /  x  [  (1  -C2)xJ 


re{(«0  l-I)/(l-«a)}1/l 
-!)/(!-<»)>»/> 


Remark  4.  It  is  easy  to  see  that  for  each  i  =  0, 1, • • • 

J\  1  -  x 2)("*“‘)/2di  =  (cm)_1(l  -  c2)("*"<+2)/2{i  +  m_1(i  -  1  -  c"J)  +  o(m-1)} 

as  m  — ♦  oo.  Use  of  this  approximation  simplifies  slightly  the  computational  burden  associated 
with  application  of  (3.3)  or  (3.4)  below.  From  this  expansion  it  is  evident  that  the  first 
term  on  the  right-hand  side  of  (3.3)  is  asymptotically  of  smaller  order  than  the  second  and 
mathematically  could  be  neglected.  In  a  number  of  related  problems  Siegmund  (1985)  shows 
numerically  that  including  this  term  typically  improves  the  approximation,  and  hence  we  have 
included  it  for  numerical  purposes. 

Remark  5.  It  is  natural  to  ask  what  precise  mathematical  meaning  can  be  attached  to  (3.3). 
As  noted  in  Remark  4,  the  first  integral  on  the  right  hand  side  of  (3.3)  is  asymptotically 
negligible.  With  some  additional  work  it  can  be  shown  that  P(T\  >  b )  is  asymptotically 
equivalent  to  the  second  integral  on  the  right-hand  side  of  (3.3).  It  suffices  to  show  that 
for  each  xo  €  (ct{"1(l  -  ti)Oo/(l  -  to)}1|/2,  c{(l  -  the  asymptotic  behavior  of  the 

conditional  probability  indicated  in  the  Theorem  holds  uniformly  for  yo  in  a  neighborhood  of 
1  -f-  c2^£‘1(1  -  ti)  -Xgti(l  -ti)-1  of  width  am/mll3,  where  — »  oo,  together  with  appropriate 
uniformity  in  Lemma  1.  The  details  are  tedious  and  have  been  omitted. 

The  procedure  in  studying  T3  is  similar  to  that  of  Tj.  Starting  with  sufficiency  arguments, 
we  have 

P(T3  >b)  =  P(T3  >  b  |  (S')2  =  1)  =  >  b(m  -  1)V*  |  (S')3  =  1) 

+  /  P(Ts>b  I  S^_x  =  x,  (S')2  =  1)P(5^_!  €  dx  I  (S')2  =  1). 

J-b(m-  1)»/J 

The  first  summand  on  the  right-hand  side  above  can  be  calculated  exactly  from  the  conditional 
density.  The  last  summand  can  be  approximated  by  using  the  theorem  to  approximate  the 
integrand.  In  this  case,  we  assume  6  =  c(m  -  l)1/2,  x  =  (m  -  l)x0,  and  mo  =  (m  -  l)to>  where 


0  <  c  <  1,  |io|  <  c,  and  0  <  to  <  1,  and  apply  the  theorem  with  g(t)  =  ct1!2 ,  £o  =  x0,  Ao  =  1, 
and  t*  =  x2/c2.  Calculations  similar  to  those  done  for  the  likelihood  ratio  test  then  lead  to 
the  approximation 


P(T3 


(3.4) 


>*)*{ 


2 (m  -  1) ) 1/2  f1 


+  c 


J  (l-x2)lm~*V2dx 

l4  _b*)(*-4)/1  / 

/  Je/ (l-c* 


'J 

-i)V/2 


.-1 


e/ll-ea)*/* 


v(x)dx. 


Some  comments  on  the  numerical  accuracy  of  the  above  approximations  can  be  found  in 
James,  James,  and  Siegmund  (1985). 


3.2.  Confidence  sets  for  the  change-point.  Our  theorem  can  be  used  to  obtain 
approximate,  likelihood-based  confidence  sets  for  j,  assuming  that  it  exists.  The  method 
extends  that  of  Siegmund  (1986,  §3.5),  who  considered  the  case  of  known  variance. 

Suppose  X\ ,  •  •  • ,  Xm  are  independent,  with  X\ ,  •  •  • ,  X,  independent,  identically 
distributed  N(ni,a2)  and  XJ+1)  •  •  • ,  Xm  independent,  identically  distributed  N(ni,cr2)  for 
some  1  <  j  <  m  —  1,  jk  P2,  and  a2  >  0  unknown.  To  test  H,  :  j  =  p  versus  Kp  :  j  p,  the 
likelihood  ratio  test  can  be  based  on  the  statistic 

m  Wk-Wp 

'  ix,-xm)2-w,’ 

where  Wk  =  ( Sk  -  kXm)2  /  {k(  1  -  k/m )}.  Under  H„,  the  distribution  of  Tp  depends  only 
on  p  and  6/cx  =  (/x2  -  ni)/v,  and  if  we  actually  wanted  to  perform  the  test,  or  obtain  a 
confidence  set  based  on  the  family  of  such  tests,  we  would  need  to  find  c  =  ca  such  that 
a  =  sup {Pp'S/oiTp  >  c)  :  S/t 7^0}.  This  problem  seems  impracticable.  However,  since  Tp  is 
stochastically  independent  of  Sm  we  do  have 

Pp,S/c(Tp  —  c)  =  Pp.i/aCPf  —  c  I  =  0), 


which  has  the  effect  of  eliminating  Xm  from  the  expression  for  Tp  and  putting  it  as  a  conditioner. 
In  the  spirit  of  Siegmund  (1986,  §3.5),  we  can  then  think  of  performing  the  test  conditionally 
by  conditioning  first  on  Sp,  which  eliminates  the  dependence  on  S/tr,  and  then  on  Up  and 


1  !/2 

■}  {k(l-k/m)y/> 


Um  -  Up.  That  is,  we  attempt  to  find  c  =  c(a,  p,  £,  A2,  A2)  such  that 
a  =  Pp(Tp  >c\Sp  =  f,Sm  =  0  ,UP  =  X1,Um  -Up  =  A2) 

(3  5)  =  P,(|S»|  >  {<  (a,  +  A,  -  +  7U^)'n  «.  - 

for  some  k  y*  p  |  Sp  =  £,  5m  =  0,  Up  =  Ai,  Um-Up  =  A2). 

A  likelihood  based,  level  1  -  a  confidence  set  C(X )  could  then  be  based  on  the  family  of  size 
a  tests  determined  by  (3.5),  by  defining 

C(X)  =L-.Tp<cL,p,Sp~  pXm,  £(Xk  -  Xm)\  f;  {Xk  -  Xm)2l  1 

{  V  *=i  fc=«+i  ;  J 

The  above  procedure  can  be  carried  out  by  using  the  theorem  to  obtain  approximate 
values  of  c(a,  p,  £,  Aj,  A2).  Assume  p/m  — >  p,  £  =  m£o,  Ai  =  mAio,  and  A2  =  mA2o,  where 
0  <  p  <  1,  £0,  A10,  and  A20  are  fixed.  Let  cq  =  (c(Ai0  +  A2o)  +  (1  -  c)CoP-1U  -  p)~1}.  Then 
the  Markov  property  implies 

RHS  (3.5)  =  pi  +P2  -  Pipi, 


where 


=  '  for 


some  k  <  p  |  Sp  =  £,  Up  =  Ax 


P2  =  Pm-p  |5*|  >  mc0 


k \l,/! 


{=(*-=)} 


for  some  k  <  m  -  p  \  Sm-P  =  Um-P  =  A2 


Applying  the  theorem  with  g(t)  =  co{t(l  -  pij/p}1/2,  we  obtain 


'"-‘•((wfer)  GAi) 


Co  V  P 


/_&_)  (  a  co  (  p  v/a 

Pi  V  \P<^2/  1*20  -  Co/  CoVl-P/ 

where  t{  -  ^p{c§(l  -  p)2  +  ^p2}-1,  t2  =  Cot1  “  P){coP2  +  Cot1  “  P*)}-1,  =  Aio  -  P~l  ' 

CoP(l  -  p)-1>  smd  <t\  =  A2q  -  e§(l  -  p)_l  -  ^(1  -  p)p-1  (<r2  and  cr|  are  assumed  >  0). 
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